Rank in Wordlist | Frequency | Word |
---|---|---|
2791 | 6 | १,००० |
6171 | 3 | २,००० |
6580 | 2 | आहे,अशी |
10571 | 2 | हवी,’ |
10666 | 2 | १२,००० |
10721 | 2 | ३,००० |
10733 | 2 | ५,००० |
10764 | 1 | 100,000/- |
10776 | 1 | 15,000 |
10826 | 1 | 3,97,559 |
Rank in Wordlist | Frequency | Word |
---|---|---|
10883 | 1 | CH3—CO—C(CH3)3 |
14854 | 1 | कमेरुँ(यू |
20426 | 1 | ढोणी(कर्णधार |
23330 | 1 | निर्वाहा(सुखा)करितां |
23974 | 1 | पदनिष्ठ(म्हणजे |
25746 | 1 | प्रमाण(क्षारता |
26116 | 1 | फ(ट |
26117 | 1 | फ(र |
26144 | 1 | फप(क्ष |
26145 | 1 | फम(क्ष |
Rank in Wordlist | Frequency | Word |
---|---|---|
3495 | 4 | आहे)’ |
10883 | 1 | CH3—CO—C(CH3)3 |
15291 | 1 | कांदा)बारीक |
17890 | 1 | घ.मी.). |
20272 | 1 | डी)नुसार |
23330 | 1 | निर्वाहा(सुखा)करितां |
25625 | 1 | प्रतिनिधी)ः |
28824 | 1 | महासंचालक(आस्थापना)मुंबई |
38054 | 1 | १९५५),विथ |
38082 | 1 | २)उत्स्फूर्त |
Rank in Wordlist | Frequency | Word |
---|---|---|
6149 | 3 | १०% |
6178 | 3 | ५% |
6179 | 3 | ५०% |
10715 | 2 | २०% |
10725 | 2 | ३३% |
10737 | 2 | ६% |
10738 | 2 | ६०% |
10741 | 2 | ७% |
10743 | 2 | ८०% |
10828 | 1 | 30% |
Rank in Wordlist | Frequency | Word |
---|---|---|
10963 | 1 | S&P-500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
11707 | 1 | अपयश'हैदराबाद |
11818 | 1 | अभियान'चे |
11952 | 1 | अर्थ'च |
13144 | 1 | आहे,''असे |
14281 | 1 | एटीएस'प्रमुख |
16199 | 1 | केसरी'त |
16504 | 1 | क्राईम'ची |
17538 | 1 | गुफ्तगू'चा |
19147 | 1 | जाईल'सेंच्युरियन |
19869 | 1 | टाळा'असा |
Rank in Wordlist | Frequency | Word |
---|---|---|
10664 | 2 | १/३ |
10764 | 1 | 100,000/- |
10765 | 1 | 1000/- |
11894 | 1 | अमेरिकेत/इतरत्र |
13033 | 1 | आवक/जावक |
13149 | 1 | आहे/ |
14754 | 1 | कथात्मक/काव्यात्मक |
14982 | 1 | करें/ऐसा |
18640 | 1 | चेटूक/चेटकीण |
19236 | 1 | जाती/जमाती |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots